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Abstract: 

In this paper we obtain an adjusted version of the likelihood ratio test for errors-in-variables multivariate linear regression 
models. The error terms are allowed to follow a multivariate distribution in the class of the elliptical distributions, which 
has the multivariate normal distribution as a special case. We derive a modified likelihood ratio statistic that follows 
a chi-squared distribution with a high degree of accuracy. Our results generalize those in Melo & Ferrari (Advances 
in Statistical Analysis, 2010, 94, 75-87) by allowing the parameter of interest to be vector-valued in the multivariate 
errors-in-variables model. We report a simulation study which shows that the proposed test displays superior finite 
sample behavior relative to the standard likelihood ratio test. 

Keywords: Elliptical distribution; Measurement error; Modified likelihood ratio statistic; Multivariate errors-in-variables 
model. 



1 Introduction 

Statisticians are often faced with the problem of modeling data measured with error. As an 
example, we refer to Aoki et al. (2001), who compared the effectiveness of two types of toothbrushes 
in removing dental plaque. One explanatory variable is the dental plaque index before toothbrushing 
and the response variable is the dental plaque index after brushing, the amount of plaque being 
imprecisely measured. The authors proposed a null intercept regression model that assumes that this 
explanatory variable is measured with an additive random error and the measurement error of the 
response variable is assumed to be absorbed by the error term of the model. 

Errors-in-variables models are generalizations of classical regression models. The true (non- 
observable) explanatory variables are treated either as random variables, in which case the model 
is said to be structural, or as unknown parameters, leading to a functional model. Structural mod- 
els are, in general, non-identifiable, while functional models induce unlimited likelihood functions. 
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Such difficulties disappear if some variances are assumed to be known (e.g. Chan &; Mak (1979) and 
Wong (1989)) or that the intercept is null (e.g. Aoki et al. (2001)). For details on errors-in-variables 
models the reader is referred to, for instance, Fuller (1987) and Buonaccorsi (2010). 

The most popular errors-in-variables models for continuous outcomes are based on normality 
assumptions. The family of the elliptical distributions provides a useful alternative to the normal 
distribution when outlying observations are present in the data. It nests the normal distribution, 
heavy-tailed distributions, such as the exponential power and the Student-t distributions, and light- 
tailed distributions. Further information on elliptical distributions can be found in Fang et al. (1990) 
and Fang and Anderson (1990). 

As shown by Melo & Ferrari (2010), statistical inference in errors-in-variables models based on 
first-order asymptotic approximations can be imprecise for small or moderate sized samples. In 
particular the type I error of the likelihood ratio test is often larger than the designed level of the 
test. Skovgaard (2001) proposed a general strategy to adjust the likelihood ratio statistic when interest 
lies in inference on a vector-valued parameter. The adjustment makes the resulting statistic to follow 
a chi-squared distribution with a high degree of accuracy. The adjustment is broadly general, but 
requires either some unusual likelihood quantities or the identification of a suitable ancillary statistic 
such that, when coupled with the maximum likelihood estimator, constitutes a sufficient statistic for 
the model. In the present paper, we obtain an appropriate ancilary statistic and derive Skovgaard's 
adjustment for a structural elliptical multivariate errors-in-variables model. 

The paper unfolds as follows. Section [2] introduces the model. Section [3] contains our main results, 
namely the ancillary statistic and an explicit formula for the modified likelihood ratio test. The finite 
sample behavior of the likelihood ratio test and its adjusted version is evaluated and discussed in 
Section HI Our simulation results clearly show that the likelihood ratio test tends to be oversized 
and its modified version is much less size-distorted. Finally, Section [5] closes the paper with our 
conclusions. Technical details are left for three appendices. 



2 The model 

The (Z + l) x 1 random vector Z is said to have a (/ + l)-variate elliptical distribution with location 
vector \i ((/ + 1) x 1), dispersion matrix S ((/ + 1) x (J + 1)) and density generating function po, and 
we write Z ~ Eln+i)(fJ,,E;po), if 

Z = n + AZ*, 

where A is (I + 1) x k matrix with rank(yl) = k, AA T = £ and Z* is a (I + 1) x 1 random vector 

with density function po(z T z), for z E The notation X = Y indicates that X and Y have 

the same distribution. It is assumed that y( l+l ^ 2 ~ l po{y)dy < oo. The density function of Z is 

p(z, /i, S) = |£|- 1/2 R) ((z - ») T ^-\z - nj) ■ (1) 
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Some special cases of (pQ) are the following multivariate distributions: normal, exponential power, 
Pearson II, Pearson VII, Student-t, generalized Student-t, logistic I, logistic II and Cauchy. The 
elliptical distributions share many properties with the multivariate normal distribution. In particular, 
marginal distributions are elliptical. For a full account of the properties of the elliptical distributions, 
see Fang et al. (1990, Sect. 2.5). 

We consider the following model, which consists of p independent errors-in-variables structural 
models: 

Yjk = a k + Pk x jk + e jk, ^ 
Xjk — %jk ~i~ l^jki 

for j = 1,2, ... , n k and k = 1,2, ... ,p, where Y jk = (Y ljk ,Y 2jk , . . ■ ,Y ijk ), a k = (a lk ,a 2k , ■ ■ -,ai k ), 
f3 k = {Pikifok, • • • , Afe) and e jk = (e ljk , e 2 j k , ■ ei jk ) T . Here, x jk is not observed directly. Instead, 
we observe Xj k , which is viewed as Xj k plus a measurement error, Uj k . We assume that ~E{ej k ) = 0, 
Var(e ifc ) = S efc = diag {a^, . . . , a\ J and Cov(e fk ,, e jk ) = 0; E(u jk ) = 0, Var(u jfc ) = a\ k 
and Cov(ujf kl ,u jk ) = 0; E(x jk ) = fi Xk , Vai{x jk ) = a 2 Xk and Cov(xji k >,Xj k ) = 0; Cov(x j/k > ,u jk ) = 0; 
Cov(e fk ,,u jk ) = Cov(e j/k ,,x jk ) = 0, with (j',k r ) / (j,k). 

Model ([2|) can be written as 

Zjk = $k + Akbjk, (3) 
for j = 1, 2, . . . , n k and k = 1, 2, . . . ,p, where 

where is the identity matrix of dimension /. We assume that, for each k = 1,2, ... ,p, the errors 
bik, b 2k , ■ ■ K k k are independent and b jk ~ El( l+2 ){r] k , tt k ;p ), with 




< 






















< 



Ilk = 

Therefore, for each k = 1, 2, . . . ,p, the random vectors Z\ k , Z 2k , . . . , Z nkk are independent and Zj k ~ 
Eli + i(/j, k ,'E k ;po), with fi k = S k + Ajt?7 fc and = A k ^l k A^ (Fang et al. 1990, Sect. 2.5). We can 
write fj, k and T, k as 

V / V a x k P k a x k + / 



Regression model ([5]) generalizes the normal structural models proposed by Cox (1976) (I = 1) and 
Russo et al. (2009) (I = 2), and the elliptical structural model considered in Melo & Ferrari (2010) 
(I = p = 1); a closely related model is presented by Garcia- Alfaro & Bolfarine (2001). As expected, 
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model ([2]) is not identifiable because the relation between the parameters of the distribution of Zj k 
and OtjA = (Pj , Oik, ^x k , cr^ fc , o\ k , f"eJ) T is not unique. Assumptions on a Xk and a-\ k are usually 
imposed to overcome identifiability problems. It is common to assume that the X Xk = &x k / a u k or 
X e - k = e-fc/°tLj f° r * = lj 2, ... ,1, is known. An alternative assumption is that the intercept a k is 
known (see Aoki et al., 2001). Under each of these identifiability assumptions we have: 

(i) if X Xk is known, 

a _ ( aT n ,. Jl „2 T \ T Q i v _ ( PkPk k °u k + S e fc Pk^x k a l k V 

*(*) - ((3 k ,a k ,» Xk ,* Uk ,cr ek ) and S fc - ^ ^ + ^ j , 

(ii) if A efe is known, 

— f fitT n ,. J2 Jl \ T QT1 J ^ — f Pk a x k Pk + ^e k °1 k Pk a l k \ 
V{k) - \Pk, a k,Vx k ,V Xk ,V Uk ) and ^fe- fc o aT 9 , *2 > 

V 7 V a x k Pk a X k + °U fe / 

with A efe = diag { X eik , X e2k , . . . , X eik } ; 

(iii) if a is known, 

a — (at „ _2 2 ^2 tN \ T „i v _ / Pk a l k Pk + S e fc /^fc "^ A 

(fe) - , M„ , ^ , , <x 6fc j and E fc - ^ ^ ^ + ^ j . 

The independent structural elliptical model can be defined in terms of the density function of 



Z (i)' Z (2)' • • • » Z (p)) > with Z (k) = i z ik, z 2k, ■■■■> z n k k) ' , for A; = 1, 2, . . . ,p, which is given 

by 

P n k 

P z(z,o) = n n \^kr i/2 p (dj k ^ i d jk ), 

k=Xj=l 

where ef,- fc = d jk (9^) = z jk -fi k , for j = 1, 2, . . . , n k , k = 1, 2, . . . ,p, and = ^0^,0^, . . . ,0^ 
The log-likelihood function for the k-th group, k = 1, 2, . . . ,p, is given by 

4(0, z) = -y log|S fc | + ^logp (4 fc S fc 1 d jfc ). 

i=i 

For a sample of size n = X^aUi n fc an d p populations, the log-likelihood function is 

v 

i(e,z) = Y t h{e,z). (4) 

k=l 

Maximum likelihood estimation of the parameters can be carried out by numerically maximizing the 
log-likelihood function through an iterative algorithm such as the Newton-Raphson, the Fisher 
scoring, EM or BFGS. Our numerical results were obtained using the library function MaxBFGS in the 
Ox matrix programming language (Doornik 2006). 
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3 Ancillary statistic and modified likelihood ratio test 



The parameter vector is partitioned as = {if> T ,u) T ) T , with if) representing the parameter of 
interest and u) the nuisance parameter. Our interest lies in testing T~Lq : if) = if)^ versus Hi : if) ^ if)^°\ 
where if)^ is a g-dimensional parameter of known constants. The maximum likelihood estimator of 6 

is denoted by 6 = {if) ,o) T ) T and the corresponding estimator obtained under the null hypothesis is 

6 = {if) , cj t ) t , where if) = if)(°\ We use hat and tilde to indicate evaluation at 6 and 6, respectively. 

The likelihood ratio statistic for testing Jio is given by 



LR = 2 {£{$)- £{ij))} . 



Under Hq, LR converges to a chi-square distribution with q degrees of freedom, where q in the 
number of restrictions imposed by Hq. This approximation can be improved if one applies a suitable 
adjustment to the test statistic. Skovgaard (2001) proposed two adjusted likelihood ratio statistics 
that are asymptotically equivalent for testing We shall denote them by LR* and LR** . The 
adjustment terms depend on a suitable ancillary statistic and involves derivatives with respect to 
the sample space. A statistic a is said to be an ancillary statistic if it is distribution constant and, 
when coupled with the maximum likelihood estimator 6, is a minimal sufficient statistic for the model 
(Barndorff-Nielsen (1986)). If {6, a) is sufficient, but not minimal sufficient, Skovgaard's results still 
hold; see Severini (2000, Sect. 6.5). In this case, the log-likelihood function depends on the data only 
through {6, a) and we write £{9;0,a). The sample space derivatives involved are £' = d£{6;6,a)/d6 
and U' = d 2 £{0-d,a)/d0d0 T . The adjusted statistics are given by 

LR* = LR (l--^ log pj (5) 

and 

LR** = LR -2 log p, (6) 

with 



_ | j ,l/2|fr/|-l| T 1 1/2 , y 1-1/2. r |l/2 i U J U V /2 

LRil 2 ~ x {£' — £') T {U')~ l U 



(7) 



Here J equals d 2 £{6;6,a)/d6d6 T evaluated at 6 = 6 and = 6. Also, J wu is the lower right 

submatrix of J that corresponds to the nuisance parameter u. Both statistics have an approximate 
X 2 distribution with high degree of accuracy under the null hypothesis (Skovgaard, 2001, p. 7). 

Let a = a{z) = (aj^z), . . . , aj p) {z^j , where aj k) (z) = (aj k {z), . . .,0^(2)) , with a jk {z) = 
P k ~ 1 {z) {zjk — p, k {z)) , j = 1, 2, . . . , rifc, k = 1,2, ... ,p, where is a lower triangular matrix such 
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that PkPk = is the Cholesky decomposition. Following Melo & Ferrari (2010) it can be shown 
that a is an ancillary statistic. With this ancillary statistic we can obtain the sample space derivatives 
which are required for the computation of the adjustment term p. 

In the following we present some matrices and vectors that form the score U (Appendix A), the 

observed information matrix J (Appendix A) and the sample space derivatives £', U' and J (Appendix 
B). In matrix notation, we have U = {U([\, . . . , Uj^) 1 , J = diagjj^, . . . , £' = ■ ■ ■ j 

U' = di&g{U' {1) , . . . , U' (p) } and J = diag{ J (1) , . . . , J (p) }, with U {k) = n* (k) + Rj k) h {k) , J {k) = 

J T (k)-Rj k) M (k) -V { T k) Q {k) , i\ k) = 2Rj k) w (k) , U\ k) = 2(Rj k) B (k) +V^C {k) ) and J (k) = 2(Rj k) F (k) + 

(T)^(fe)) - ^"^h element of n>( k \ is tr(S^ 1 S ( - fc - ) j), for i = 1,2, ... ,s and k = 1,2, ... ,p. Here, s is 
the total number of parameters in 6( k y When the ratio X Xk or the intercept a k is known, we have 
s = 21 + 3, and when the ratio A efe is known, s = I + 4. The (i, i')-th element of T( fc ) is 

t(k)ii> = tr(S (fe)i S( fc )j') + tr(S fe _1 S( fe )jj/), 

where E (fe)i = <9S fc /<9(9 (fc)i , Y, {k)ii , = dY, {k)i / d6 {k)i , and Y>^ % = dY^ 1 / 86 ^ = -Y lk 1 T,^ k)i T, k 1 , for i, i' = 
1,2, ... ,s. Here, 9( k )i is the z-th element of 6 k ; see Appendix C. Also, RnA and Vm are block-diagonal 
matrices given by P( fe) = diag(r (fc) , r (fc) , . . . , r (fc) ) and Vq.) = diag(t> (fc ), v (fc ), . . . , t> (fc) ), with dimension 
sn k x s, and j-th element of the vectors and given by r^j = Wp (dJ k T; k l dj k ) and 



T 



Wp (dj k T, k djk), respectively. Additionally, we define the column vectors = [h^ , ■ ■ ■ , 

and W(ty = (^ w ^ > ■ ■ ■ j w q!) ) > w hh dimension sn^, and j-th element of the vectors and 
for i = 1, 2, . . . , s, given, respectively, by 

and 



(») 
it;;/ . 



where P^ = dPk/d6^)i and = dp, k /d9^ i . The derivative P^ is obtained through the 

algorithm proposed by Smith (1995) and the derivative fir^i is presented in Appendix C. The block 
matrices P>r k \, Cr k \, F( k ), Gn.\, Mt k \ and Q( k \, with dimension sn k x s, have the (i, i')-th block given, 
respectively, by the vectors b 1 ^ , c 1 ^ , , g 1 ^ , m 1 ^ and q 1 ^ . The j-th elements of these vectors 
are, respectively, 

bf k )j = (P(fe)i' o-jk + fi(k)i') T ^ k ^(RkO-jk + fik~ Mfc) ~ M^i^fe 1 {P{k)i> a jk + fi(k)i'), 

c ?k)j = (P(k)i' a ik + /i (fc) j/) T S^ 1 (Pfeaj fe + Uk~ Mfc) [(Pfeflife + Afc - /i fc ) T S (fc)i (Pfca jfc + - /i fc ) 

- 2 t J -Jk)i T 'k' 1 {Pkajk + fik- Vk) . 

•^Wi = a J fc + fi(k)i>) T ^ (k)t Pk a jk ~ J^Jk)i^k l (P(k)i' a jk + #(fc)i')> 
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9(k)j = { p {k)i> a jk + fJ-(k)i') T ^k lp k a jk (a]k P k^ k)lp ka.jk - 2fiJ k ^T lk 1 P k a jk ^ , 
m% tk)i = d Jfc s(fc)M d jk ~ 2M( ^ fe )iS ( " :), d jk - 2^J k)i ,Y, ( - k)l d jk - 2/iJ. )ii ,S fe " 1 d jk + V(fc)i'> 

= (4* s(fc)<d i* " ^SjiSfc" 1 ^*) (4 fc S (fc)i 'd jfc - 2 / xJ fc)i ,£ J r 1 d i *) , 

where /z (fc)ii , = 8^ k)i /d9 {k)il and £«"' = dY,^ /dO {k)v = -2£( fc )*£ {fc)i ,£ fc 1 - E^E^ET/ 1 ; see 
Appendix C. 

By replacing J, J^, J^lh, J, U, U', £' — £' and the likelihood ratio statistic LR in (|7|) we obtain 
p, the quantity that is required for computing the adjusted statistic LR* in ([5]) and its equivalent 
version LR** given in ([6]). Note that p depends on Zjk, n k , S^ 1 , -Pfc and their first and second 
derivatives with respect to the parameters. It is worth mentioning that the distribution of Zj k is only 
required for obtaining the matrices Rr^ and Vha. 

As a final remark, we mention the connection between our results and those obtained by Melo & 
Ferrari (2010). In their paper, the model under study is the special case of model ([2]) when I = p = 1. 
The authors obtained the Barndorff-Nielsen (1986) adjustment to the signed likelihood ratio statistic 
for testing hypotheses on a scalar parameter. The adjustment term, given in eq. (6) of their paper, 
can be calculated using the quantities obtained in the present paper for the case in which tp is scalar 
(q = 1). Therefore, our results enables us to calculate Barndorff-Nielsen's (1986) adjusted signed 
likelihood ratio statistic in model ([2]). Hence, our results generalize those in Melo & Ferrari (2010). 

4 Simulation study 

In this section we present a Monte Carlo simulation study to evaluate the efficacy of the adjust- 
ments derived in the previous section. The performances of the tests that use the likelihood ratio 
statistic (LR), and the adjusted statistics (LR* and LR**) will be compared with respect to the type 
I error probability. 

The simulations use model (|3|) with I = 1 and p = 5. Two different distributions for Zjk are 
considered, namely a bivariate normal distribution and a bivariate Student-i with 3 degrees of freedom 
{y = 3). The number of Monte Carlo replications was 10,000, the nominal levels of the tests are 
7 = 1%, 5% and 10% and the sample sizes are n\ = . . . = n p = 10, 20, 30 and 40. All simulations were 
performed using the Ox matrix programming language; Doornik (2006). 

We consider tests of Ho : if) = ip^ versus T-L\ : i/> ^ where if) = (f3±, fa, ■ ■ ■ , /3 g ) T , 

for q = 2,3, 4, 5. Also, we consider = when X Xk or A efe is known, and = 1 when the 
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intercept is null. The true parameter values are a± = ■ ■ ■ = as = 0.5, a xi = • • • = a X5 = 1.5, 
a\ x = ■■■ = al 5 = 0.5, a\ = ■ ■ ■ = a\ h = 2.0. For A Xfc or A 6fc known, we set [i Xl = ■ ■ ■ = \i Xh = 0.5, 
and when the intercept is null we set (i xi = • • • = fj, xs = 5.0. 

Tables 1 and 2 present rejection rates (in percentage) of the three tests for all the scenarios 
described above. We notice that the likelihood ratio test (LR) is liberal when the sample size is small 
in all the cases considered here. For instance, when Zjk is normally distributed, q = 3, A efc is known 
and rife = 10, the rejection rates of the test that uses LR are 11.4% (7 = 5%) and 19.4% (7 = 10%); 
see Table 1. Under the same scenario, except that Zj\. now follows a Student-t distribution, the 
rejection rates are 10.9% (7 = 5%) and 18.6% (7 = 10%). The adjusted tests (LR* and LR**), on 
the other hand, display much better behavior in all cases: they are much less size distorted than the 
likelihood ratio test. For example, in the normal case with \ Xk known, = 10 and 7 = 10%, the 
rejection rates are 16.9% (LR), 10.2% (LR*) and 9.8% (LR**) for q = 2, and 18.6% (LR), 10.2% 
(LR*) and 9.5% (LR**) for q = 3. As a second example, we mention the case in which the underlying 
distribution is normal, the intercept is null, q = 5, n& = 10 and 7 = 5%. The rejection rates are 
9.3% (LR), 5.2% (LR*) and 5.0% (LR**). Also, for the normal case with A Cfc known, n k = 20 and 
7 = 1% the rejection rates are 1.9% (LR), 1.1% (LR*) and 1.0% (LR**). It can be noticed that, 
as the number of parameters under test (q) grows, the likelihood ratio test deteriorates while the 
behavior of the adjusted tests remains unaltered. See, for example, the figures in Table 1 relative to 
the Student-t case with A efe known, = 10 and 7 = 10%; the rejection rates are 18.4% (q = 2), 
18.6% (q = 3), 19.9% (q = 4) and 21.3% (q = 5) for LR, 11.4% (q = 2), 10.4% (q = 3), 10.8% (q = 4) 
and 11.0% (q = 5) for LR*, and 10.9% (q = 2), 10.1% (q = 3), 10.2% (q = 4) and 10.1% (q = 5) for 
LR**. 

[Tables 1 and 2 here] 

Our numerical results confirm that the adjusted tests are much better behaved than the original 
likelihood ratio test in small samples. For almost all the cases, the test that uses the LR** displays 
slightly better performance than its asymptotically equivalent version, LR*. 

5 Concluding remarks 

In this paper we dealt with the issue of performing hypothesis testing in an elliptical multivariate 
errors-in-variables model when the sample size is small. We derived modified likelihood ratio statistics 
that follow very closely a chi-squared distribution under the null hypothesis. Our approach is based 
on Skovgaard's (2001) proposal, which requires the identification of a suitable ancillary statistic. We 
obtained the required ancillary and all the needed quantities to explicitly write the correction term. 
Our simulation results clearly suggested that the adjustment we derived is able to correct the liberal 
behavior of the likelihood ratio test in small samples. 
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Appendix A. The observed information matrix 

The first derivative of the log-likelihood function for the fc-th group, k = 1,2, ... ,p, with respect to the parameters 

is 

- -^tr (E fc _1 E (fc)i ) + y~] W P0 (dJkE^djk) (dJ k Y, {k)l d jk - 2fi {k)i T, k ~ 1 d jk ^ , 



<:)() 



(k)i 



for i = 1,2, . . . ,s, where W P0 (u) = 8 \ogp (u)/8u, fi (k)i = 8^ k /89 (k)i , T,( k)i = dT, k /d9 {k)i and E (fc)l = 8T, k 1 /89 {k)i = 
— Efc~ 1 E( fc ) i Efc~ 1 . The (i,i')-th element of the observed information matrix for the fc-th group, J( k ), is given by 
J(k)a' = —d 2 lk{d)/d9( k - )i dQ( k ) i i, i.e. 

JWW = Y tr ( E(fc)!E W-') + Y tr ( E *" lE W«') " E \ W po (<*J* E * 'd : .) (d] k ^d ]k 

- 2 M{fc)i S fc~ ld j'fe) (d7feS (fc)l djfe - 2iiJ k)l ,Y,k~ 1 djk^j + W po ^dJfeEfe -1 djkj ^dJ fe E (fe)M dju 

- SMCfejiS^' 1 djfe - 2fj,J k)l ,T, ik)l dj k - 2nJ k)ll ,Y> k ~ 1 d jk + 2/zJ )j E*rV(fc)i' ) j, 

for = 1,2, . . . ,s and fc = 1,2, . . . ,p, where W PQ (u) = 9W P0 (w)/du, /Lt (fc)M / = 8(i (k)i /d9( k )i' , E (fe)ii / = 8Y; {k)i /d9 (k)il 
and E<*>"' = dZ Wi /dO {k)i , = -^E^'E^Efe" 1 - E^E^E^" 1 . 

Appendix B. Sample space derivatives (^', U' and J) 

Let a be the ancillary statistic defined in Section [3] and let us write Zj k = P k aj k + fi k . Inserting Zj k in the 
log-likelihood function we have 

£ k (0;§,a) = -^log|E fe | +^logp {{Pkajh + fi k - fJ, k ) T Y< k ~ 1 (P k a jk + fi k - H k , 



Hence, £' = 81(8; , a) /80 = Ufa , , . . . , l'^ J , where the i-th element of the vector l[ k) is 

t'(k)i = 2 E W po ((-Pfc a ifc + Mfc - M fc ) T E fe " 1 (P fe a J fc + Mfe - Mfe)) (-P(fc)ifljfc + M(fc) l ) T Sfc _1 (P fc a j fc + £ fc - /x fc ), 
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for i = 1,2,..., s and k = 1,2,..., p. Also, we have that U' = d 2 £(O;O,a)/d§d0 T = diagjV^, U[ 2) , 
. . . , U'qp) |, where the (i, i')-th element of the matrix U' is given by 



U\ k )ii' = 2^ S Wpo (PkO-jk -Mfc) T S fc ^^Oj-k + Mfc - Mfc)) ((^(*)i'Oj* + £(fc)i') TsWl (-Pfc a i*; + fi k 
i=i { 

- Mfc) ~ ^{fc)iSfe _1 (P (fe)4 /a jfe + M(fc)»')) + Wp ((-Pfc a jfc + Mfc - Mfc) T S fe _1 (Pfca jfc + £ fe - M fc )) 
{P {k)i ,a jk +fi (k)i ,) T T, k ~ 1 (Pka :jk +tl k - [(Pkajk + fi k - fi k ) T E (k) \P k a jk + £ fe - /x fc ) - 2/x 



(*)» 



where P( k )i = dP k /Q( k )i, for £ = 1,2, ... ,s and fe = 1, 2, . . . ,p. We also have that 



~_ d 2 e(0;9,a) 



oeoe 1 



diag| J(i), J( 2 ), . . • , J(p)|, 



where the (i, i')-th element of J( fc j is given by 

J(k)W = 2^< W P0 (d^E^d,*,) ((-P(fc)i'Oifc + M(fc)i') T £ (fe)! -Pfc a jfc -fiJvi^iPwi'ajk +M(fc)i' 
»=i I 

+ ( d Jfe^fc (P(k)i>a jk + fi ik)i ,) T: £ k 1 Pka jk (^aJ k P k Il {k)l P k a jk - 2flJ k)i T,~ 1 P k a jk ) |. 

In matrix notation we have 

* w =2^)11;^), =2(i$ ) B w +Vj ) C w ) and J w = 2 + V ( I)G W ) ; 

the elements of the matrices B^ k ), C( k ), Fr k ), G(k) and of the vector W(k) are defined in Section [3] for k = 1, 2, 



Appendix C. Derivatives of the vector fik and of the matrix with 
respect to the parameters 

When the ratio X Xk = a 2 k /a 2 . k is known the first derivative of fj, k has elements 

( (0,...,0, ,0,...,0) T , if i= 1,2,...,/ 

position i 



= < 



(1\0) 

(/£,!) 
0, 



if i = I + 1 
if i = Z + 2 

if i = Z + 3,Z + 4, .. . ,s. 



The first derivative of Efc with respect to the parameter vector 0( k ) is now given for i = 1,2, ... ,s: 
• if i — I + 1 or i = Z + 2, then Eu,)j is null; 
. if i = Z + 3, we have E (fc)i = f Pf*& f 



(8) 
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is i = I + 4, 1 + 5, . . . , s, the elements of are null except for the (i — I — 3, i — I — 3)-th element, which is given 
by 1. 



if i = 1, 



, / , we have 



2 

x k u u k 



l3ik^x k o-"t k \x k &t k \ 




x k "u k 






fiikXx k a Uk 



if i = 1 





) 



\ 




I V 



Plk\x k O~ u $2k\x k w if 



2Pik\x k crl k Xx h cr Uk 











2 

o " / 



if i = I. 



The second order derivative of /i fc is 

r (o,...,o, i 



M(fe)ii' 



= < 



,0, ...,0) T , if i = 1,2, . . . ,Z and i' = I + 2 



position j 



(0, ...,0, 1 ,0, ...,0) T , if i = / + 2 and i' = 1,2, 



(9) 



position i' 
0, 



otherwise. 



For i, i' = 1, 2, . . . , s, the elements of the matrix Yj(k)w are null except for the following cases: 

• if i = 1, 2, . . . , I and i' = 1,2, ... ,1, we have that, i — i! , the elements of ^(k)a are null except the (i, i)-th element, 
which is equal to 2\ Xk a^ k . When i ^ i' , the elements of the matrix are null, except those in positions 



and which are equal to X Xk cr^ 

if i = 1, . . . , I and i' — I + 3 we have 



/ 2/3ik\x k P2k^x k 
fck\x k 



-'(k)ii' 



o o 



/3ik\x k 
V A,, 



/ 




Plk\x k 
$2k^x k 



if j = 1 





/ 



\ 





/3lfcA Xfc P2k\x k 

V 



if i = /. 



/ 



When the ratio A efc is known the first and second order derivatives of fi k are given, respectively, in ((8]) and ([SJ, 
with s = Z + 4. The derivative of with respect to the parameter vector On,.) is 
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• if i = 1, . . . , Z we have 



-"(fc)i 



V o o 



• if i = Z + 1 or i = Z + 2, the matrix E( fc )j is null 



Pik&1 k 



Pikcr1 k 






Pike1 k 
fok&i k 



if i = 1 





/ 



\ 




o / 



if i = Z; 



• if i = I + 3, S (fc)i = ^ 



• if i = Z + 4 = s, E (fc)l = 



A efc 
1 



For — 1,2, ... ,s, we have that the elements of E^^/ are null except for the cases: 
• if i = 1, . . . , Z and i' — I + 3, we have 



fok 



A* 
V i 



j (fc)»»' 



-"(fc)i'i 



/ 







A* i \ 
o o 



o o 
o o / 



Pik o \ 

/3 2fc 



if i = 1 



/3ife /32i 

V o o 



1 



1 

o / 



if i = Z; 



• if i = 1, 2, . . . , Z and i! = 1, 2, . . . , Z, we have that, for i — i' , the elements of the matrix are null except 

the (i, i)-ih elements, which is equal to 2cr Xfe . When i =^ i', the elements of the matrix are null except the 

(i,i')-th and the (i',i)-th elements, which are equal to a^ k . 

When the intercept a k in known the first order derivative of fi k is 

{ (0,...,0, ^ ,0,...,0) T , if i = l,2,. ..,1 
position i 



M(k)» - 



(/^,1) T , 

0, if i = Z + 2,Z + 3, ...,s. 

The derivative of E fc with respect to 0(k) is now presented for i = 1, 2, . . . , s. We have 



if 



Z + l 
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• if i = I + 1, the matrix E( fc )j is null; 

• if i = I + 2 we have 



• if i = Z + 3 we have £( fe ); = 





1 



if i = I + 4, 1 + 5, . . . , s, the elements of the matrix £( fe ); is null except for the (i — I — 3, i — I — 3)-th element, 
which equals 1; 



• if i = 1, . . . , Z, we have 



-(fc)i 



Afc^ fe °l k \ 




/ 

o 



if z = 1 





/ 



\ 








if i = I. 



/ 



V o 

The second order derivative of fj, k is 

(0, ...,0, ^ ,0, ...,0) T , if i = 1,2,..., I and i' = 1 + 1 
position i 



M(fe)i 



(0, ...,0, 1 ,0, ...,0) T , if i = l + l and i' = 1,2, . . . ,1 



position i 
0, 



otherwise. 



For = 1,2, s, we have that the elements of Effcw are null except for the cases: 

• if i = 1, 2, . . . , I and i' = 1,2, ... ,1, we have that, for i = i' , the elements of are null except for the (i, i)-ih 
element, which is equal to 2 a X/ _. When i ^ i' , the elements of £(*,)»;' are null except for those in positions (i, i') 

and which are equal to a Xk ; 

• if i = 1, . . . , I and i' = t + 2, we have 



J (k)ii' — ^(fc)i'i 



I 2p\u P2k 
fck 



V i 



/ o o 
o o 



Plk f$2k 
\ 








/ 



Pik o ^ 

P2k 



if z = 1 



2pik 
1 



1 

/ 



if i = I. 
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Table 1: Null rejection rates; nt = 10. 



X Xk known 

Normal distribution Student-i distribution {y = 3) 



7 = 5% ' 7 = 10% 7 = 5% ~ 7 = 10% 



Q 


LR 


LR* 


LR** 


LR 


LR* 


LR** LR 


LR* 


LR** 


LR 


LR* 


LR** 


2 


10.1 


5.1 


4.9 


16.9 


10.2 


9.8 


9.9 


5.2 


5.0 


16.7 


10.6 


10.2 


3 


11.6 


5.3 


5.0 


18.6 


10.2 


9.5 


10.7 


5.2 


4.9 


18.4 


10.3 


9.8 


4 


12.5 


5.3 


4.8 


20.4 


10.2 


9.6 


12.0 


5.4 


5.1 


19.9 


10.6 


10.1 


5 


13.6 


5.2 


4.8 


21.5 


10.3 


9.6 


12.7 


5.5 


5.1 


20.8 


10.4 


9.8 
















known 













Normal distribution Student-i distribution (y = 3) 

7 = 5% 7 = 10% 7 = 5% 7 = 10% 



9 


Li? 


LR* 


LR** 


LR 


LR* 


LR** 


LR 


LR* 


LR** 


LR 


LR* 


LR** 


2 


10.1 


5.1 


4.9 


17.3 


10.1 


9.6 


10.7 


5.8 


5.5 


18.4 


11.4 


10.9 


3 


11.4 


5.3 


4.9 


19.4 


10.1 


9.6 


10.9 


5.3 


5.0 


18.6 


10.4 


10.1 


4 


13.0 


5.2 


4.7 


20.8 


10.7 


9.8 


11.9 


5.5 


5.2 


19.9 


10.8 


10.2 


5 


13.7 


5.2 


4.7 


22.4 


10.3 


9.6 


13.2 


5.5 


5.2 


21.3 


11.0 


10.1 



null intercept 

Normal distribution Student-i distribution (y = 3) 

7 = 5% 7 = 10% 7 = 5% 7 = 10% 



5 


Li? 


LR* 


LR** 


LR 


LR* 


LR** 


LR 


LR* 


LR** 


LR 


LR* 


LR** 


2 


7.8 


5.1 


4.9 


13.9 


10.2 


10.0 


7.3 


5.5 


5.4 


13.3 


10.6 


10.5 


3 


8.6 


5.0 


4.9 


15.2 


10.2 


10.0 


7.4 


5.2 


5.2 


13.6 


10.4 


10.2 


4 


8.9 


5.1 


5.0 


15.7 


10.2 


10.0 


7.3 


4.8 


4.7 


13.9 


10.2 


10.0 


5 


9.3 


5.2 


5.0 


16.4 


10.1 


9.8 


7.9 


5.1 


5.0 


14.6 


10.3 


10.2 
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Table 2: Null rejection rates; q — 3. Normal distribution. 













known 












7=1% 






7 = 5% 






7 = 10% 


nk 


LR 


LR* 


LR** 


LR 


LR* 


LR** 


LR 


LR* LR** 


10 


3.6 


1.1 


1.0 


11.6 


5.3 


5.0 


18.6 


10.2 9.5 


20 


1.9 


0.9 


0.9 


7.8 


5.1 


5.1 


13.8 


10.0 9.9 


30 


1.4 


1.0 


0.9 


6.3 


4.7 


4.7 


12.3 


9.7 9.6 


40 


1.3 


0.9 


0.9 


6.0 


4.8 


4.8 


11.7 


9.9 9.9 












known 












7=1% 






7 = 5% 






7 = 10% 


nk 


LR 


LR* 


LR** 


LR 


LR* 


LR** 


LR 


LR* LR** 


10 


3.5 


1.2 


1.0 


11.4 


5.3 


4.9 


19.4 


10.1 9.6 


20 


1.9 


1.1 


1.0 


8.0 


5.2 


5.1 


14.6 


10.4 10.3 


30 


1.4 


0.9 


0.9 


6.9 


5.2 


5.2 


12.4 


10.0 10.0 


40 


1.3 


0.9 


0.9 


6.0 


4.8 


4.8 


11.7 


9.9 9.9 


null intercept 






7=1% 






7 = 5% 






7 = 10% 


nk 


LR 


LR* 


LR** 


LR 


LR* 


LR** 


LR 


LR* LR** 


10 


2.1 


1.1 


1.0 


8.6 


5.0 


4.9 


15.2 


10.2 10.0 


20 


1.5 


0.9 


0.9 


6.0 


4.5 


4.5 


11.9 


9.8 9.7 


30 


1.3 


1.1 


1.1 


6.0 


5.1 


5.1 


11.6 


10.0 9.9 


40 


1.4 


1.1 


1.1 


5.6 


4.9 


4.9 


11.1 


10.1 10.1 
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